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The only theory of correlation at present available for practical 
use is based on the normal law of frequency, but, unfortunately, this 
law is not valid in a great many cases which are both common and 
important. It does not hold good, to take examples from biology, 
for statistics of fertility in man, for measurements on flowers, or for 
weight measurements even on adults. In economic statistics, on the 
other hand, normal distributions appear to be highly exceptional: 
variation of wages, prices, valuations, pauperism, and so forth, are 
always skew. In cases like these we have at present no means of 
measuring the correlation by one or more “ correlation coefficients ” 
such as are afforded by the normal theory. 

It seems worth while noting, under these circumstances, that in 
ordinary practice statisticians never concern themselves with the 
form of the correlation, normal or otherwise, but yet obtain results of 
interest—though always lacking in numerical exactness and fre¬ 
quently in certainty. Suppose the case to be one in which two 
variables are varying together in time, curves are drawn exhibiting 
the history of the two. If these two curves appear, generally 
speaking, to rise and fall together, the variables are held to be corre¬ 
lated. If on the other hand it is not a case of variation with time, 
the associated pairs may be tabulated in order according to the 
magnitude of one variable, and then it may be seen whether the 
entries of the other variable also occur in order. Both methods are 
of course very rough, and will only indicate very close correlation, 
but they contain, it seems to me, the point of prime importance at 
all events with regard to economic statistics. In all the classical 
examples of statistical correlation ( e.g ., marriage-rate and imports, 
corn prices and vagrancy, out-relief and wages) we are only 
primarily concerned with the question is a large x usually associated 
with a large y (or small y ) ; the further question as to the form of 
this association and the relative frequency of different pairs of the 
variables is, at any rate on a first investigation, of comparatively 
secondary importance. 

. Let Ox , O y be the axes of a three dimensional frequency-surface 
drawn through the mean 0 of the surface parallel to the axes of 
measurement, and let the points marked (x) be the means of succes¬ 
sive aj-arrays, lying on some curve that may be called the curve of 
regression of x on y. Kow let a line, RR, be fitted to this curve, 
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subjecting the distances of tbe means from the line to some minimal 
condition. If the slope of HR is positive we may say that large 
values of x are on the whole associated with large values of y , if it is 
negative large values of x are associated with small values of y. 
Further, if the slope of RR to the vertical be given we shall have a 
measure of a rough practical kind of the shift of the mean of an 
a?-array when its type y is altered. The equation to RR conse¬ 
quently gives a concise and definite answer to two most important 
statistical questions. It is also evident that if the means of the 
arrays actually lie in a straight line (as in normal correlation), the 
equation to RR must be the equation to the line of regression. 

Let n be the number of observations in any r-array, and let d be 
the horizontal distance of the mean of this array from the line RR* 
I propose to subject the line to the condition that the sum of all 
quantities like ndr shall be a minimum, 7.e., I shall use the condition 
of least squares. I do this solely for convenience of analysis; I do 
not claim for the method adopted any peculiar advantage as regards 
the probability of its results. It would, in fact, be absurd to do so y 
for I am postulating at the very outset that the curve of regression is 
only exceptionally a straight line ; there can consequently be no 
meaning in seeking for the most probable straight line to represent 
the regression. 

Let x, y be a pair of associated deviations, let a be the standard 
deviation of any array about its own mean, and let 
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X= a + bY 

be the equation to RR. Then for any one array 

S {x—(a-\-by)Y = S{a?— (a + bY)} 2 = no 1 + 

Hence, extending the meaning* of S to summation over the whole 
surface 

S (nd 2 ) = S {«3 — (a-{-by) } 2 —Sw<r 3 . 

But in this expression S(W 2 ) is independent of a and b, it is, in fact,, 
a characteristic of the surface. Therefore, making $(nd 2 ) a minimum 
is equivalent to making 

S{a— (a + by) } 2 

a minimum. That is to say, we may regard our method in another 
light. We may say that we form a single-valued relation 

x = a -f- by 

between a pair of associated deviations, such that the sum of the 
squares of our errors in estimating any one x from its y by the 
relation is a minimum. This single-valued relation, which we may 
call the characteristic relation, is simply the equation to the line of 
regression RR. There will be two such equations to be formed 
corresponding to the two lines of regression. 

The idea of the method may at once be extended to the case of 
correlation between several variables x u x % , x$, &c. Let n be the 
number of observations in an array of x\ 9 s associated with fixed 
values X 2 , X 3 , X 4 , &c., of the remaining variables, let <rx be the 
standard deviation of this array, and let d be the difference of its. 
mean from the value given by a regression equation 

Xx = a 12 X 3 -f a l ^K^-)ra u X i + . 

Then, as before, we shall determine the coefficients a 12 , a 13 a 14 , &c., so 
as to make S nd 2 a minim am. But this is again equivalent to 
making 

S{#x — (a n x i -) r a n X3+a li x i +.... )} 2 

a minimum for 

S{a?i — (a n x 2 + ^ 13 ^ 3 T a -h.. . • ) } 2 = S (no. x 2 ) + S (nd 2 ). 

Hence, we may say that we solve for a single-valued relation 
X\ a\iXo ~f” ^i3<^3 ~f” a\\X± ”1“ • • • • 

between our variables; the relation being* such that the sum of the 
squares of the errors made in estimating X\ from its associated 
values £c 2 , a* 3 , &c., is the least possible. In the case of normal correla- 
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tion this “ characteristic relation ” must become the “ equation of 
regression ” which gives the means of any aq-array, as only in this 
way can be made a minimum, i.e., zero. 

It might be said that it would be more natural to form a “ charac¬ 
teristic relation ” between the absolute values of the variables and 
not their deviations from the mean. This may, however, be most 
conveniently done by working with the mean as origin until the 
characteristic is obtained, and then transferring the equation to zero 
as origin. It would be much more laborious and would only lead to 
the same result if zero were used ab initio as origin. 

We may now proceed to the discussion of the special cases of two, 
three, or more variables. The actual formuke obtained are not, it 
will be found, novel in themselves, but throw an unexpected light 
on the meaning of the expressions previously given by JBravais* for 
the case of normal correlation. 

(1) Case of Tioo Variables .—Since x and y represent deviations 
from their respective means, we have, using S to denote summation 
over the whole surface, 


S(tf) = S (y) = 0. 

The characteristic or regression equations which we have to find are 
of the form 

a? = %-f . rjx 

y rz: a% -f b 2 X J. ^ ' ’ 

Taking the equation for x first, the normal equations for esq and l\ 
are 

S(» = 'Na 1 -\-b l 8(y) | ^ 

8(xy)=a l 8(y)+b 1 8(y0l . . 

Y being the total number of correlated pairs. From the first of 
these equations we have at once 


From the second 


«i = 0. 


&i = 


8 (xy ) 

s (zr) 


To simplify our notation let us write 

8(x 2 ) = N**. 8f-= 

S (xy) = ~Nr(Ti(T 2 . 




<*i and a 2 are then the two standard-deviations or errors of mean 

* “ Memoires par divers Savants,” 1846, p. 255, and Professor Pearson’s paper 
on £C Regression, Heredity, &c.” £ Phil. Trans./ A, vol. 187 (1896), p. 261 et seq. 
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square, r is Bravais’ value of tlie coefficient of correlation. Re¬ 
writing hi in terms of these symbols, we have 

= .. (3). 

°2 

Similarly, Og = 0, b 2 = r —.. (4). 


But the expressions on the right of (3) and (4) are the values 
obtained by Bravais on the assumption of normal correlation for the 
regression of x on y, and the regression of y on x. That is to say, 
the Bravais values for the regressions are simply those values of by 
and & 2 » which make 

S(a;— bxy) 2 and S (x—btyf 

respectively minima, whatever be the form of the correlation between the 
two variables . Again, whatever the form of the correlation, if the 
regression be really linear, the equations to the lines of regression are 
those given above (as we pointed out in the introduction). This 
theorem admits of a very simple and direct geometrical proof. 

Let n be the number of correlated pairs in any one array taken 
parallel to the axis of x , and let 0 be the angle that the line of 
regression makes with the axis of y. Then, for a single array, 

S (xy) = 2 /S(u?) = ny 2 tan 0 , 

or extending the significance of S to summation over the whole 
surface, 

S (xy) = N* tan 0 <t 2 2 9 

that is, 

tan 0 = r — . 

<r 2 

In any case , then , inhere the regression appears to be linear , Bravais 9 
formula} may be used at once without troubling to investigate the 
normality of the distribution . The exponential character of the surface 
appears to have nothing whatever to do with the result . 

To return, again, to the most general case, we see that both 
coefficients of regression must have the same sign, namely, the sign 
of r. Hence, either regression will serve to indicate whether there is 
correlation or no, for there is no reason, a priori , why the values of 
hi and b 2 , as determined above, should be positive rather than 
negative. But, nevertheless, the regressions are not convenient 
measures of correlation, for, on comparing two similar cases, we may 
find, say, 


2 o 


VOL. LX. 


hi > V i, 


b% < b'if 





482 Mr. G. U. Yule. On the Significance of Bravais ? Formulae 


where b Y b 2 , V-fi* are the regressions in the two cases. To which 
distribution are we, in such a case, to attribute the greater corre¬ 
lation P Bravais’ coefficient solves the difficulty, we may say, in 
one way, by taking the geometrical mean of the two regressions as 
the measure of correlation. It will still remain valid for non-normal 
correlation. But there are other and less arbitrary interpretations 
even in the general case. 

Suppose that instead of measuring x and y in arbitrary units we 
measure each in terms of its own standard deviation. Then let us 
write 


x y 

- = P — 

( 7 1 < 7 2 


(5), 


and solve for p by the method of least squares. We have omitted a 
constant on the right-hand side, since it would vanish as before. We 
have, at once, 


_ S J>2/) B i- r 
S(f) 


( 6 ), 


That is to say, if we measure x and y each in terms of its own 
standard deviation, r becomes at once the regression of x on y> and 
the regression of y on x. The regressions being, in fact, the funda¬ 
mental physical quantities, r is a coefficient of correlation because it 
is a coefficient of regression.* 

Again, let us form the sums of the squares of residuals in equations 
(1) and (5). Inserting the values of b 1} b 2} and p, we have— 


S (y-b 2 xy = N<7 2 2 (l-r 2 ) 



1 




j 


( 7 )- 


Any one of these quantities, being the sum of a series of squares, 
must be positive. Hence r cannot be greater than unity. If r be 
equal to unity, or if the correlation be perfect, all the above three 
sums become zero. But 


can only vanish if 


-+-Y 
, 01 02 ./ 


x y 
-±- = 0 
0"l 02 


in every case, or if the relation hold good, 

* That the regression becomes the coefficient of correlation when each deviation 
is measured in terms of its standard-deviation in the case of normal correlation has 
been pointed out by Mr. Francis Galton. Vide Pearson 1 Phil. Trans./ A, vol. 187, 
p. 307, note. 
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X i $2 *^3 

yi ~~ 2/2 ~ 2/3 


=b 


El 

ff 2 


'( 8 ), 


the sign of the last term depending on the sign of r. Hence the 
statement that two variables are “ perfectly correlated ” implies that 
relation (8) holds good, or that all pairs of deviations bear the same 
ratio to one another. It follows that in correlation, where the means 
of arrays are not collinear, or the deviation of the mean of the array 
is not a linear function of the deviation of the type, r can never be 
unity, though we know from experience that it can approach pretty 
closely to that value. If the regression be very far from linear, some 
caution must evidently be used in employing r to compare two diffe¬ 
rent distributions. 

In the case of normal correlation, vi- r 2 is the standard devia¬ 
tion of any array of the jx variables, corresponding to a single type of 
2 /’s. vis is similarly the standard deviation of any array of 
the y variables, corresponding to a single type of x’s. In the general 
case, the first expression may be interpreted as the mean standard 
deviation of the ^-arrays from the line of regression, and the second 
expression as the mean standard deviation of the y -arrays from the 
line of regression. Otherwise we may regard 

a I V 1— r 2 


as the standard error made in estimating x from the relation 


« = hy, 

and 

< T 2 s/l—r 3 


as the standard error made in estimating y from the relation 

y = hx, 

these interpretations being independent of the form of the correla¬ 
tion. 

(2.) Case of Three Variables, 

Let the three correlated variables be X x , X 2 , X 3 , and let .x l9 x 3 
denote deviations of these variables from their respective means. Let 
us write, for brevity, 

sOi 2 ) = xv, so, 2 ) = :nv 

so* 2 ) = n<t 3 2 

SfXiXz') = Xr 12 ^i^2 
S(a? 2 a? 3 ) = Xr 23 <T 2 <r 3 
SC^i) = Nr 3X ^i- 


2 o 2 
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Oar characteristic or regression-equation will now be of the form 

x l = b u x 2 -f bwx 3 . (9), 

l 12 and &i 3 being the unknowns to be determined from the observations 
by the method of least squares. I have omitted a constant term on 
the right-hand side, since its least-square value would be zero as 
before. The two normal equations are now— 


SOitf 2 ) = b l2 S(x 2 2 }+ &i 3 S(afea> 8 ) 


SO^s) = b l2 S(x 2 x 3 ) +biS(%a), 

or replacing the sums by the symbols defined above, and simplify - 


mg— 





r X2 (Ti — bi 2 (T 2 -\-bi Z r 2Z G Z 'f 
f*i 3 °i ss b i2 r 2z cr 2 b xz a z j 

. (10), 

whence 

7 r 12 —r l2 r 2Z 

*ri 



I-r.' 


. (11). 


^ _ ^13 — r X2 r 2Z 

fl| 


^13 — 2 

1—r 23 - 

«3J 



That is, the characteristic relation between x x and x 2 x z is— 


Xi 


r 12 —r 13 r 23 a\ 
1 — r 2 z a 2 




^13 “ r 12^*83 

1—r» a 



( 12 ). 


Now Bravais showed that if the correlation were normal , and we 
selected a group or array of Xx’s with regard to special values h 2 and 
7*3 of Xz and a? 3 , then h x being the deviation of the mean of the selected 
Xi’s from the Xx-mean of the whole material, 


h x — b l2 h 2 -\-biJi 2 , 

where b l2 and 5 l3 have the values given in (11). But evidently the 
relation is of much greater generality; it holds good so long as h x is 
a linear function of h 2 and h z , whatever be the law of frequency. 

Further, the values of b X2 and b xz above determined, are, under any 
circumstances, such that 

Sv 2 = S [x x — (b X2 x 2 4* bizxf) ] 2 , 

is a minimum. If we insert in this expression the values of b X2 and 
5 x 3 from (11), we have, after some reduction, 

so 2 ) = nv (l- r jl±T}lr:. 

l 1— r m * 


( 13 ), 
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say, In normal correlation B** is the standard deviation of 

an Xi-array, corresponding to any given types of X 2 and X 3 . In 
general correlation it may be regarded as the mean standard deviation 
of the Xx-arrays from the plane 

or as the standard error made in estimating x x from x 2 and x 3 by 
relation (12). 

The quantity R is of some interest, as it exactly takes the place of 
r in the residual expressions (7). Rx may, in fact, be regarded as a 
coefficient of correlation between x x and (x 2 x 3 ) ; it can only be unity 
if the linear relation (9) or (12) hold good in every case. 

The quantities bn, b i2 , &c, (the others may be written down by 
symmetry), may be termed the net regressions of x x on a? 2 , on a? 3j 
&c. If we write 2 for 1 and 1 for 2 in the value of b l2 , we have 

7. ^12 — r 13^*23 ffa 

0 2l ------, 

1—^23 

b 2 1 being the the net regression of x 2 on x x . In normal correlation, 
bn and b 2l are the regressions for any group of X/s or X 2 ’s associated 
with a fixed type of X 3 ’s. Hence, in this case (normal correlation), 
the coefficient of correlation for such a group is the geometrical mean 
of the two regressions, or 

_ 9*13 — 9*13 y 23i 

/Jn ~ v/Cl-V) (l-r^) 

a quantity that may be called the net coefficient of correlation 
between x x and The similar net coefficients between x x and x 3t 
x% and x 3 , may be written down by interchanging the suffixes. 

In normal correlation p 12 is quite strictly the coefficient of correla¬ 
tion for any sub-group of Xx’s and X 2 ’s, whatever the associated type 
of X 3 ’s. In generalised correlation this will not be so, and p n can 
only retain an average significance. 

The method does not appear to be capable of investigating changes 
in the net coefficient as we pass from one type to another, but it may 
be noted that whatever the form of the correlation, p l2 retains three 
of the chief properties of the ordinary coefficients : (1) it can only be 

# My quantities, h l2 , bn, &c., were termed by Professor Pearson (“ Regression 
&c.” ‘ Phil. Trans., 5 A, yol. 187 (1896), p. 287), “Coefficients of double regression, 5 ’ 

and quantities like b n Q , b xz , &e., “ coefficients of double correlation.” My 

quantities p he did not use. Having named the p’s “ net correlation,” it seemed 
most natural to rename the 5’s “ net regressions,” as the 5’s and p’s are correspond¬ 
ing quantities. 

Some of my results given above were quoted by Professor Pearson in his paper 
(loe. cit notes on pp. 268 and 287). 
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zero if both net regressions are zero; (2) it is a symmetrical func¬ 
tion of the variables ; (3) it cannot be greater than unity; for, 

t>y ( 13 )» 

r^ + rj— 2r 12 r 23 ?3i < l—r 23 2 , 

or adding r 13 2 r 23 2 to both sides, and transferring r 13 2 to the right-hand 
side 

0*12— ^IS^) 2 < (1 — T i3 2 ) (1 — T 22 ) . 

If any two coefficients, say ri 2 r 13 , be supposed known, the inequality 
we have used above will give us limits for the value of the third. 
Throwing it into the form 

0 * 23 “ r 12^13) 2 < 1 r 12 V 13 2 —?’ 12 2 —r 13 2 , 

we have r 23 must lie between the limits 

y i2?'i3 ± ^la^is 3 — Tif— rtf -j-1. 

The values of these limits for some special cases are collected in 
the following table :— 


Values of r 12 and r 13 . 
r 12 = ri 3 = 0 
r l2 = r 13 = ±1 
r 13 = +1, r 13 = — 1 
n 2 = 0 , r 13 = ±1 
= 0, r 13 = ±r 

■ri2 = ri3=±r 

^*12 = 4* r, r 13 = —r 

r 12 = r 13 = + 0'5 “ 0*707 

r 12 = + a/ 0‘5 r 12 = — x/o 7 ^ 


Limits of r 23 . 

0 

+.1 
— 1 

0 

1 and 2r 2 —*1 
2r 2 —1 and —1 
0 and 1 


One is rather prone to argue that if A be correlated with B, and B 
with C, A will be correlated with 0. Evidently this is not necessary. 
A may be positively correlated with B, and B positively correlated 
with 0, but yet A may, in general, be negatively correlated with 0. 
Only, if the coefficients (AB) and (BC) are both numerically greater 
than 0*707, can one even ascribe the correct sign to the (AC) corre¬ 
lation. 

It is evident that one would, in general, expect to make a smaller 
standard error in estimating from the two associated variables x% 
and a’ 3 , than in estimating it from one only, say x 2 . But it seems 
desirable to prove this specifically, and to investigate under what 
conditions it will hold good. The necessary condition is— 


^i2 2 + r 13 2 —2r 12 r 23 n 3 ■ „ 2 
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that is, 

n 2 2 +n 3 2 — 2 r 12 r 13 r 2 3 > r 12 8 —n 2 2 9 *i 3 2 , 
or 

0*13 —n 2 ^23) 2 > 0. 

But (ri 3 —*ri 3 r 2 3 ) is the numerator of p lz , the net coefficient of corre¬ 
lation between Xi and x$. Hence the standard error in the second 
case will be always less than in the first, so long as is not zero. 
The condition is somewhat interesting. 

To take an arithmetical example, suppose one had in some actual 
case 

9*12 =: 4* 0*8 

9*23 — +0*5 r l3 = +0*4. 

One might very naturally imagine that the introduction of the third 
variable with a fairly high correlation coefficient (0*4) would con¬ 
siderably lessen the standard deviation of the a+array; but this is 
not so, for 

0*4—(0*5 X 0*8) 

P|Q --- —N — 0 , 

<v/0*75 X 0*36 

so the third variable would be of no assistance. 

III. Case of Four Variables . 

This case is, perhaps, of sufficient practical importance to warrant 
our developing the results at length as in the last. 

If Xi, a? 2 , o? 3 , # 4 , be the associated deviations of the four variables 
from their respective means, the characteristic equation will be of the 
form 

®i. = + b iz x 3 + b u x .. (14). 

The normal equations for the b’s are, in our previous notation, 

9*12 ^l “ bi2(T 2 + b 1Z T 2Z (T Z + &i4?*21°‘4 
9*l3°'l = ^129*23^2 “b b\ Z (T Z + by]Xzi^i / 

9*14^1 ^ 5129*24^2 4 & 139*34°3 + frl4°4 ^ 


Hence 


9*12 

9*23 

9*24 

9*13 

1 

9*31 

9*14 

9*34_ 

I 

1 

9*23 

9*24 

9*23 

1 

9*34 

9*24 

9*34 

1 


and so on for the others, b lz , &c., we may call the net regressions 
of Xi on a? 2 , x 1 on x 3 , &c., as before. By parity of notation, we have 
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n 2 

^*23 

ru 


n 3 

1 

r 3 i 1 


ru 

r 3 4 

1 ] 

a 2 

i 

n 3 

ru 

a l 

n 3 

i 

rsi 


ru 

^34 

1 



and we may again call 

pn — ^/biob^i, 

the net coefficient of correlation between x x and x 2 . Expanding the 
determinants, we have, in fact, 


, ^* 12(1 — ^ 34 2 ) +ria(r S 4 ru~r 2H ) _ 

^[(1 — r 3i 2 ) + ^23(^34^24-^23) + ^24h 23 r 34~ ^24)][(1 -**34 2 ) + ^(VM —**1 3 ) + -^m)] 

. -(16). 

There are six such net coefficients, p l2 , p 13 , p lh p 23 , p 2i , p H . The 
above values of the regressions are again those usually obtained on 
the assumption of normal correlation.* The net correlation p l2 
becomes, on that assumption, the coefficient of correlation for any 
group of the x x x 2 variables associated with fixed types of x 3 and x 4 . 

If we write 

U = Xi — ( 612*2 + 613*3 + 614 * 4 ), 
we have, after some rather lengthy reduction, 


where 


R 1 2 


f ^ 12 2 + r 13 2 + - >y/r 34 2 - r 23 V 14 2 - r 13 V 24 2 1 

1 ~ 2 (r 13 ^ 14 r 34 + r^r u r 2i + r n r u r 23 ) + 2(r n r u r^ 4 + r n r u r 23 r. 2i + r n r^r. 2A r :u ) J * 
1—^s 2 - ^34* - W 


In normal correlation, a- lx /l — Ki 2 is the standard deviation of all Xj- 
arrays associated with fixed types of x 2 , * 3 > and x 4 . In general corre¬ 
lation, it is most easily interpreted as the standard error made in 
estimating x u by equation (14), from its associated values of x 2i x 3 , 
and x 4 . 

As in the case of three variables, the quantity It may be considered 
as a coefficient of correlation. It can range between +1, and can 
only become unity if the linear relation (14) hold good in each indi¬ 
vidual instance. 

We showed at the end of the last section that the standard error 
made in estimating x x from the relation 

X x " 612*2 4“ #i3*3 


* Professor Pearson, “Eegression, Heredity, and Panmixia.” ‘Phil. Trans.,’ 
A, vol. 187(1896), p. 294. 






Mathematical Contributions to the Theory of Evolution. 489 

was always less than the standard error when only x 2 was taken into 
account, unless 

Piz = 0 . 

We may now prove the similar theorem that when we use three 
variables, x 2 , x 3 , a? 4 , on which to base the estimate, the standard error 
will be again decreased, unless 


pu = 0. 

The condition that S(% 2 ), in our present case, shall be less than 
S (r 2 ) in the last, is, in fact, 

frj+rj+ru'-rvfr 

^ ~2(r 13 r 14 r 34 +r 12 r 13 r 23 +r 12 r 14 r 24 ) >(l-~r 23 2 ) 

l 4- 2 (r 12 r l4 r 23 r 34 -f r 14 r 13 r 24 r 23 -f r n r 13 r 24 r 34 ) J 
^ (?'n "f* r\z 2 r 12 ri 3 r 23 ) (l—r 23 2 —r 24 *—+ 2 r 2 3 r 24 r 34 ). 

This may be finally reduced to— 

O'u—- r 13 r 34 —r 12 r 24 —r 14 r 23 2 +r 13 r 23 r 24 4- r 12 r 23 r 34 ) 3 > 0, 
that is p l4 2 > 0. 

The treatment of the general case of n variables, so far as regards 
obtaining the regressions, is obvious, and it is unnecessary to give it 
at length. 

We can now see that the use of normal regression formulae is quite 
legitimate in all cases, so long as the necessary limitations of inter¬ 
pretation are recognised. Bravais’ r always remains a coefficient of 
correlation. These results 1 must plead as justification for my use of 
normal formulae in two eases* where the correlation was markedly 
non-normal. 


“ Mathematical Contributions to the Theory of Evolution.—On 
a Form of Spurious Correlation which may arise when 
Indices are used in the Measurement of Organs/’ By 
Karl Pearson, F.R.S., University College, London. Re¬ 
ceived December 29, 1896,—Read February 18, 1897. 

(1) If the ratio of two absolute measurements on the same or 
different organs be taken it is convenient to term this ratio an index . 

If u =fi(x, y ) and v =/ 2 (z, y) be two functions of the three variables 
x, y , z 7 and these variables be selected at random so that there exists 
no correlation between x 7 y 7 y 7 z 7 or z,x 7 there will still be found to 

* ‘Economic Journal/ Dec., 1895, and Dec., 1896, “ On the Correlation of Total 
Pauperism with Proportion of Out-relief.* 1 * 



